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Abstract — Delay tolerant Networks (DTNs) leverage the mo- 
bility of relay nodes to compensate for lack of permanent 
connectivity and thus enable communication between nodes that 
are out of range of each other. To decrease message delivery delay, 
the information to be transmitted is replicated in the network. 
We study replication mechanisms that include Reed-Solomon 
type codes as well as network coding in order to improve the 
probability of successful delivery within a given time limit. We 
propose an analytical approach that allows us to compute the 
probability of successful delivery. We study the effect of coding 
on the performance of the network while optimizing parameters 
that govern routing. 

Index Terms — Delay Tolerant Networks, Optimal Scheduling, 
Coding, Network Codes 

I. Introduction 

DTNs exploit random contacts between mobile nodes to 
allow end-to-end communication between points that do not 
have end-to-end connectivity at any given instant. The contacts 
between any two nodes may be quite rare, but still, when 
there are sufficiently many nodes in the system, the timely 
delivery of information to the destination may occur with high 
probability. This is obtained at the cost of many replicas of 
the original information, a process which requires energy and 
memory resources. Since many relay nodes (and thus network 
resources) may be involved in ensuring successful delivery, 
it becomes crucial to design efficient resource allocation and 
data storage protocols. In this paper we address this combined 
problem. The basic data unit that is transferred or stored is 
called a frame, and to transfer successfully a file, all frames 
of which it is composed are needed at the destination. We 
consider both energy costs as well as memory constraints: The 
memory of a DTN node is assumed to be limited to the size 
of a single frame. We study adding coding in order to improve 
the storage efficiency of the DTN. We consider Reed-Solomon 
type codes as well as network coding. The basic questions are 
then: (i) transmission policy: When the source is in contact 
with a relay node, should it transmit a frame to the relay? 
(ii) scheduling: If yes, which frame should a source transfer? 
Each time the source meats a relay node, it chooses a frame i 
for transmission with probability u % . In a simple scenario, the 
source has initially all the frame and u % are fixed in time. It 
was shown in [2] that the transmission policy has a threshold 
structure: use all opportunities to spread frame till some time 
a and then stop (this is similar to the "spray and wait" policy 
[13]). Due to convexity arguments it turns out that the optimal 
u l does not depend on i [2]. In this paper we assume a general 
arrival process of frames: they need not become available 



for transmission simultaneously at time zero as in [2]. We 
further consider dynamic scheduling: the probabilities u l may 
change in time. We define various performance measures and 
solve various related optimization problems. Surprisingly, the 
transmission does not follow anymore a threshold policy (in 
contrast with [2]). We extend these results to include also 
coding, and show that all performance measures improve 
when increasing the amount of redundancy. We then study the 
optimal transmission under network coding. Related Work 
The works [7] and [14] describe the technique to erasure code 
a file and distribute the generated code-blocks over a large 
number of relays in DTNs. The use of erasure codes is meant 
to increase the efficiency of DTNs under uncertain mobility 
patterns. In [14] the performance gain of the coding scheme 
is compared to simple replication, i.e., when additional copies 
of the same file are released. The benefit of erasure coding is 
quantified by means of extensive simulations and for different 
routing protocols, including two-hops routing. In [7], the au- 
thors address the case of non-uniform encounter patterns, and 
they demonstrate strong dependence of the optimal successful 
delivery probability on the way replicas are distributed over 
different paths. The authors evaluate several allocation tech- 
niques; also, the problem is proved to be NP-hard. The paper 
[5] proposes general network coding techniques for DTNs. In 
[8] ODE based models are employed under epidemic routing; 
in that work, semi-analytical numerical results are reported 
describing the effect of finite buffers and contact times; the 
authors also propose a prioritization algorithm. The paper [15] 
addresses the design of stateless routing protocols based on 
network coding, under intermittent end-to-end connectivity. A 
forwarding algorithm based on network coding is specified, 
and the advantage over plain probabilistic routing is proved 
when delivering multiple frames. Finally, [4] describes an 
architecture supporting random linear coding in challenged 
wireless networks. The structure of the paper is the following. 
In Sec.|n]we introduce the network model and the optimization 
problems tackled in the paper. Sec. [Ill] and Sec. [IV] describe 
optimal solutions in the case of work conserving and not-work 
conserving forwarding policies, respectively. Sec. IVl addresses 
the case of energy constraints. Sec. [Vl] deals with erasure 
codes. Rateless coding techniques are presented in Sec. IVIII 
The use of network coding is addressed in Section IVIIII 
Sec. IIXI concludes the paper. 

II. The model 

The main symbols used in the paper are reported in Tab. Q] 
Consider a network that contains N + 1 mobile nodes. We 
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TABLE I 

Main notation used throughout the paper 



whose solution is 



Symbol 



N 
K 
11 
A 

Xi(t) 

x it x 

z 

Ui(t) 

u 

Zi{t),Zi 

A(r) 
Ps{r) 



Meaning 



number of nodes (excluding the destination) 
number of frames composing the file 
number of redundant frames 
inter-meeting intensity 
timeout value 

number of nodes having frame i at time t (excluding the 

destination) 

summation JA Xi(t) 

corresponding sample paths 

:=X(0) will be taken unless otherwise stated. 

forwarding policy for frame i; u = (ui, 112, . . . , uk) 

sum of the UjS 

Zi{t) = Jo Xi(u)du, Zi = Zi(r), Z(t) = 
(Z^t), Z 2 (t), . . .), Z = Z(r), Z = £ ^ 
probability of successful delivery of frame i by time r 
probability of successful delivery of the file by time r; 
P s (t, X, i? ) is used to stress the dependence on K and H 
nonnegative real numbers 



X{t) = l + (z-l)e 



~ X Jo H r ) dr 



, X(0) = z (3) 



assume that two nodes are able to communicate when they 
come within reciprocal radio range and communications are 
bidirectional. We also assume that the duration of such con- 
tacts is sufficient to exchange all frames: this let us consider 
nodes meeting times only, i.e., time instants when a pair of not 
connected nodes fall within reciprocal radio range. Also, let 
the time between contacts of pairs of nodes be exponentially 
distributed with given inter-meeting intensity A. The validity 
of this model been discussed in [6], and its accuracy has been 
shown for a number of mobility models (Random Walker, 
Random Direction, Random Waypoint). A file contains K 
frames. The source of the file receives the frames at some 
times ti < t<z < ... < tx- U are called the arrival times. We 
assume that the transmitted file is relevant during some time r. 
By that we mean that all frames should arrive at the destination 
by time ti + t. Furthermore, we do not assume any feedback 
that allows the source or other mobiles to know whether the 
file has made it successfully to the destination within time r. 
If at time t the source encounters a mobile which does not 
have any frame, it gives it frame i with probability Ui(t). We 
assume that u = 1 where u = Ui{t). There is an obvious 

constraint that Ui(t) = for t < ti. Let X(t) and X(f) be 

the n dimensional vectors whose components are Xi{t) and 

Xi(t). Here, Xi(i) stand for the fraction of the mobile nodes 
(excluding the destination) that have at time t a copy of frame 

i, and Xi(t) the expectation of Xi(t). 

A. Dynamics of the expectation 

Let X(t) = X)j=i Xi(t)- The dynamics of Xi is given by 

dXi(t) 



dt 



= Ui(t)X(l-X{t)) 



(1) 



Taking the sum over all i, we obtain the separable differential 
equation 

dX(t) 



where z is the total initial number of frames at the system at 
time t = 0. Thus, Xi(t) is given by the solution of 

= -Ui(t)X( Z - l) e -^o-(r)dr (4) 

Unless otherwise stated, we shall assume throughout z = 0. 



B. Performance measures and optimization 

In the following we will use fluid approximations for 
deriving optimal control policies that the source can use to 
maximize the file delivery probability. Denote by D{t) the 
probability of a successful delivery of all K frames by time 
r. Define the random variable D(r|?x) as the successful 
delivery probability conditioned on X, where 5Fx is the natural 
filtration of the process X [3]. We have 



E[D k (t\5x)]=E 



K 



J(l - exp(-A^)) 



(5) 



where = J Q r Xi(s)ds. We shall consider the asymptotics 
as N becomes large yet keeping the total rate A of contacts a 
constant (which means that the contact rate between any two 

individuals is given by A = X/N). Using strong laws of large 

numbers, we get limjv^oo Zi(N) = E[Zi\ a.s. Observe that 
since eq. (0 is bounded, using the Dominated Convergence 
Theorem, we obtain 



K 

P s {t) = lim E[D k {t\?x,N)} = TT(1 - exp(-XE[Zi])) 

N— >oo 

.Also, the expected delivery time (i.e. the time needed to 
transmit the whole file) is given by 



/>oo 

E[D] = / (l-P s (T))dr 
Jo 



dt 



Xu(l-X(t)) 



(2) 



Definition 2.1: We define u to be a work conserving policy 
if whenever the source meets a node then it forwards it a 
frame, unless the energy constraint has already been attained. 
We shall study the following optimization problems: 

• PI. Find u that maximizes the probability of successful 
delivery till time r. 

• P2. Find u that minimizes the expected delivery time over 
the work conserving policies. 

Definition 2.2: An optimal policy u is called uniformly 
optimal for problem PI if it is optimal for problem PI for 
all t > 0. 
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Energy Constraints: Denote by £(t) the energy consumed 
by the whole network for transmitting and receiving a file 
during the time interval [0, t]. It is proportional to X(t) — X(0) 
since we assume that the file is transmitted only to mobiles 
that do not have the file, and thus the number of transmissions 
of the file during [0, t] plus the number of mobiles that had it at 
time zero equals to the number of mobiles that have it. Also, let 
e > be the energy spent to forward a frame during a contact 
(notice that it includes also the energy spent to receive the file 
at the receiver side). We thus have £(i) = e(X(t) - X(0)). 
In the following we will denote x as the maximum number of 
copies that can be released due to energy constraint. Introduce 
the constrained problems CP1 and CP2 that are obtained from 
problems PI and P2 by restricting to policies for which the 
energy consumption till time r is bounded by some positive 
constant. 

III. Optimal scheduling 

A. An optimal equalizing solution 

Theorem 3.1: Fix r > 0. Assume that there exists some 
policy u satisfying i\^f =1 u\ = 1 for all t and JJ" Xi(t)dt is 
the same for all i's. Then u is optimal for PI. 

Proof. Define the function £ over the real numbers: ((h) = 
1 — exp(— Xh). Denote Z = (Z\, . . . , Zk) such that Z{ = 
Jq Xi(v)dv. We note that £ is concave in h and that 

logP s (r, u) = J^iLi l°g(C(^i))- It tnen follows from Jensen's 
inequality, that the success probability when using u satisfies 



logP s (r,u) < K log (C(Z/K)) 



(6) 



where Z = Y^f=i an d w i tn equality if Zj are the same for 



all i's. This implies the Theorem. 



oNot always 



it will be possible to equalize the above integrals. A policy u 
which is optimal among the work conservative policies will 
be obtained by making them as equal as possible in a sense 
that we define next. 

B. Schur convexity Majorization 

Definition 3.1: (Majorization and Schur-Concavity [11]) 
Consider two n-dimensional vectors d(2). d(2) majorizes 
d(l), which we denote by d(l) -< d(2), if 

k k 

< E d w( 2 )< k = l,...,n-l, (7) 

i=l i=l 
n n 

and £d M (l) = X)d M (2), (8) 

i=l i=l 

where d[{\(m) is a permutation of d,(m) satisfying dm(rn) > 
d[2]{ m ) > ■■■ > dr n i(m), m = 1,2. A function / : R n — > R 
is Schur concave if -< d(2) implies f(d(l)) > f{d(2)). 

Lemma 3.1: [11, Proposition C.l on p. 64] Assume that 
a function g : R n — > R can be written as the sum g[d) = 



SILi ^(di) where ift is a concave function from R to R. 
Then g is Schur concave. 

Theorem 3.2: log P s (r, u) is Schur concave in Z = 
(Z 1 ,...,Z K ). Hence if Z -< Z' then P s (r,u) > P s (r,u'). 

C. 77ze case K = 2. 

Consider the case of If = 2. Let the system be empty at 
time 0, i.e., z = 0, and let ti = 0. Consider the policy that 
transmits always frame 1 during t G [ti,t2]> an d from time ti 
onwards it transmits only frame 2. Then 

X M - / X ^ < t < t 2 
X(t 2 ) t 2 <t<r 

where X(t) = 1 — exp(— At). Also, 



-Ai 



< t < t 2 

U<t<T 



This gives 

Xi{t)dt 



-1 + At 2 + e 



-At 2 



A 



+ (r-i 2 )(l-e- At ») 



-At 2 



X 2 (t)dt= — 



-(A(r-f 2 )-l + e- A(r - t2 )) 



We compute the value of r for which /J" Xi(t)dt = 
X2,{t)dt. We denote by t eq the solution. We obtain (almost 
instantaneous with Maple 9.5^0 



teg- A 



LambertW ( — 



exp(0 



and where ^ 



1 -2exp(-Ai 2 ) 
1 + 2e- xu - + 2\t 2 e~ xt 



1 - 2e 



-At 2 



Then we have the following. 



Theorem 3.3: (i) Assume that r < £ eg . Then there is 
no work conserving policy that equalizes C X\{t)dt = 

X 2 (t)dt. Thus there is no optimal work conserving optimal 
for PI. 

(ii) Assume that r = t eq . Consider the policy u' that transmits 
always frame 1 during t £ [t±, t 2 ), then transmits always frame 
2 during time t G [t 2 ,r). Then this work conserving policy 
achieves J Q r X\(t)dt — J Q r X 2 (t)dt and is thus optimal for PI. 

(iii) Assume now r > t eq . Consider the work conserving 
policy u* that agrees with u' (defined in part ii) till time t eq 
and from that time onwards uses u\ — u 2 = 0.5. Then again 
/J" Xi(t)dt = JJ" X 2 (t)dt and u* is thus optimal for PI. 

oNote that the same policy u* is optimal for PI for all 
horizons long enough, i.e., whenever r > t eq as u* equalizes 
C X\(t)dt = Jq X 2 (t)dt for all values of t > t eq , because 
u\ = u 2 = 0.5. Moreover, we have 

Theorem 3.4: The work conserving policy u* described at 
(ii) in Thm. 13.31 is uniformly optimal for problem P2. 



'LambertW below is known as the inverse function of f(w) = uiexp(ui) 
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TABLE II 
Algorithm A 



Al Use pt = ei at time t £ [ti,<2)- 

A2 Use pt = e2 from time t2 till s(l,2) = 
min(5(2, {1, 2}),i 3 ). If s(l,2) < t 3 then switch to 



Pt = i(ei + e2) till time {3. 



. K; 



A3 Define tfc+i = T. Repeat the following for i = 3, . 
A3.1 Set j = i. Set s(i, j) = ti 

A3.2 Use p t = J2l=j e fc from dme s (*'i) tin 

s(i,i-l) : =min(S(i,{l,2,...,j}),t i+1 ).If j = 1 
then end. 

A3. 3 If s(i,j — 1) < then take j = min(j : j S 

J(t, {1, ...,»})) and go to step [A3. 2]. 




Fig. 1. Success probability under 
non work conserving policy u(s) as a 
function of s for A = 1, 3, 8, 15; top 
curve corresponds to largest value of 
A; second top corresponds to second 
largest A etc. (this order changes only 
at s very close to 0.5). 



Fig. 2. The evolution of X(t) 
as a function of t under the best 
work conserving policy for A = 
1,3, 8, 15. The curves are ordered 
according to A with the top curve 
corresponding to the largest A etc. 



Proof. The policy u* is work conserving. By construction, for 
any work conserving policy u', Z(t) -< Z'(t). The optimality 
then follows from Theoreml3.2l o 



D. Constructing an optimal work conserving policy 

We propose an algorithm that has the property that it gener- 
ates a policy u which is optimal not just for the given horizon 
t but also for any horizon shorter than r. Yet optimality 
here is only claimed with respect to work conserving policies. 
Definitions: 

• Zj(t) := J^Xj(r)dr. We call Zj(t) the cumulative 
contact intensity (CCI) of class j. 

• I(t, A) := mmj £ A(Zj, Zj > 0). This is the minimum 
non zero CCI over j in a set A at time t. 

• Let J(t, A) be the subset of elements of A that achieve 
the minimum I(t, A). 

• Let S(i,A) := sup(i : i £ J(t,A)). 

• Define e.; to be the policy that sends at time t frame of 
type i with probability 1 and does not send frames of 
other types. 

Recall that t\ < t 2 < ... < tx are the arrival times of frames 
1, K . Consider the Algorithm A in Table [XT] Algorithm A 
seeks to equalize the less populated frames at each point in 
time: it first increases the CCI of the latest arrived frame, trying 
to increase it to the minimum CCI which was attained over 
all the frames existing before the last one arrived (step A3. 2). 
If the minimum is reached (at some threshold s), then it next 
increases the fraction of all frames currently having minimum 
CCI, seeking now to equalize towards the second smallest CCI, 
sharing equally the forwarding probability among all such 
frames. The process is repeated until the next frame arrives: 
hence, the same procedure is applied over the novel interval. 
Notice that, by construction, the algorithm will naturally 
achieve equalization of the CCIs for t large enough. Moreover, 
it holds the following: 

Theorem 3.5: [See Appendix] Fix some r. Let u* be the 
policy obtained by Algorithm A when substituting there r = 
00. Then 

(i) u* is uniformly optimal for P2. 

(ii) If in addition X l {t)dt are the same for all i's, then u* 
is optimal for PI. 



IV. Beyond work conserving policies 

We have obtained the structure of the best work conserving 
policies, and identified their structure, and identified cases in 
which these are globally optimal. We next show the limitation 
of work-conserving policies. 

A. The case K=2 

We consider the example of Section UlI-CI but with r < t eq . 
Consider the policy u(s) where = ii < s < t.2 which trans- 
mits type-1 frames during [ii,s), does not transmit anything 
during [s, t-i) and then transmits type 2 frames after £2- It then 
holds 

X(t) 0<t<s 

X(s) S <t<T 



X 1 (t) = 



where X(t) = 1 — exp(— At). Also, 

< t < t 2 

X 2 (t)={ X(t-(t 2 -s))-X(s) = 



e -„ s _ e -x(t-(t 2 -s)) u<t< 



2 St St 



This gives 



— 1 + As 4- p~ A;s 

Xl (t)dt = 1 + AS J + (r - - e- As ) 



A 



X 2 (t)dt = ^— (A(r - t 2 ) - 1 + e-^-^) 



A 



Example 4.1: Using the above dynamics, we can illustrate 
the improvement that non work conserving policies can bring. 
We took r = 1, t± = 0, t 2 = 0.8. We vary ,s between 
and t,2 and compute the probability of successful delivery for 
A = 1,3, 8 and 15. The corresponding optimal policies u(s) 
are given by the thresholds s = 0.242,0.242,0.265,0.425. 
The probability of successful delivery under the threshold 
policies u(s) are depicted in Figure Q] as a function of ,s 
which is varied between and t 2 - In all these examples, there 
is no optimal policy among those that are work conserving. 
A work conserving policy turns out to be optimal for all 
A < 0.9925. Note that under any work conserving policy, 
/ T X 2 (t)dt < t(1 - X{t 2 )) (where X(t 2 ) is the same for 
all work conserving policies). Now, as A increases to infinity, 
X(t 2 ) and hence Xi(t 2 ) increase to one. Thus /J" X 2 (t) tends 
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to zero. We conclude that the success delivery probability tends 
to zero, uniformly under any work conserving policy. 
Recall that Theorem B .3 I provided the globally optimal policies 
for t eq < t for K = 2. The next Theorem completes the 



derivation of optimal policies for K 

T. 



2 by considering t eq > 



2 with t eq > t, 



Theorem 4.1: [See Appendix] For K 
there is an optimal non work-conserving threshold policy 
u*(s) whose structure is given in the beginning of this 

subsection. The optimal threshold is given by s = j log ^1 — 
A(r-t 2 ) ) Any other policy that differs from the above on 



a set of positive measure is not optimal. 

B. Time changes and policy improvement 

Lemma 4.1: Let p < 1 be some positive constant. For 
any multi-policy u = {ui(t), u n (t)} satisfying u = 
2~27=i u i{t) < P f° r a U *> define the policy v = {v\, —,v n } 
where Vi = Ui(t/p)/p or equivalently, Ui = pv.i(tp), i = 
l,...,n. Define by Xi the state trajectories under u, and let 
Xi be the state trajectories under v. Then X(t) = X(tp). 
Proof. We have 



dX(s) 
ds 



v(s)X(l-X(s)) 



where v = Yh—i v i- Substituting s = tp we obtain 



dX(s) dX(s) 



dt 



d(s) 



pv(s)\(l - X(s)) = u(t)A(l - X(s))) 



We conclude that X (t) = X(tp). Moreover, 



dXi{s) _ dX t ( S ) 



dt 



= P- 



ds 



:pVi{s)\(l-X{s)) = Ui (t)X(l-X(s)) 



We thus conclude that Xi(t) = Xi{tp) for all i. oThe control 
v in the Lemma above is said to be an accelerated version 
of u from time zero with an accelerating factor of 1/p. An 
acceleration v of u from a given time t 1 is defined similarly 
as Vi(t) = ui(t) for t < t' and v t {t) = u % (t' + (t - t')/p)/p 
otherwise, for all i = 1, ...,n. We now introduce the following 
policy improvement procedure. 

Definition 4.1: Consider some policy u. and let u := 
2j=i u j{t)- Assume that u < p over some < p < 1 for 
all t in some interval S — [a,b] and that f?u(t)dt > for 
some c > b. Let w be the policy obtained from u by 

(i) accelerating it at time b by a factor of 1/p, 

(ii) from time d := a + p(b — a) till time c — (1 — p){b — a), 
use w(t) = u(t + b — d). Then use w(t) = till time c. 

Let X{t) be the state process under u, and let X(t) be the 
state process under w. Then 

Lemma 4.2: Consider the above policy improvement of u 
by w. Then 

(a) Xi(t) > Xi(t) for all < t < c, 



TABLE III 
Algorithm B 



Bl Use pt = utei at time t G [ii,t2)- 

B2 Use pt = u t e 2 from time t 2 . till min(S(2, {1, 2}), t 3 ). If 
S(2,{1, 2}) < t 3 then switch to p t = |(ei + e 2 )«t till 
time 43. 

B3 Define tjf+i = T - Repeat the following for i = 3, ...,K: 
B3.1 Set j = i. Set s(i, j) = ti 

B3.2 Use pt = J2k=j e k u t from time s(i,j) till 

s{i,j-l) := mm(S(j,{l,2,...,i}),t l+1 ). If j = 1 
then end. 

B3.3 If s(i,j — 1) < ti-^i then take j = min(j : j G 



J(t, {1, i})) and go to step [B3.2]. 



(b) Xi(c) =Xi(c) for all i, 

(c) J^X^dt^ %Xi(t)dt. 

C. Optimal policies for K > 2. 

Theorem 4.2: Let K > 2. Then an optimal policy exists 
with the following structure: 

• (i) There are thresholds, Sj G , , i = l,...,K. 
During the intervals [si,ti + i) no frames are transmitted. 

• (ii) Algorithm B to decide what frame is transmitted at 
the remaining times. 

• (iii) After time tx it is optimal to always transmit a frame. 
An optimal policy u satisfies u(t) = 1 for all t >tn (it 
may differ from that only up to a set of measure zero). 

Proof, (i) Let u be an arbitrary policy. Define u(t) = 
l~2jUj(t). Assume that it does not satisfy (i) above. Then 
there exists some i = l,...,K — 1, such that u(t) is not a 
threshold policy on the interval := [ij,£j_|_i). Hence there 
is a close interval S = [a, b] C Ti such that for some p < 1, 

u (t) < P for all t 6 S and J fc * 1+1 u(t)dt > 0. Then u can be 
strictly improved according to Lemma 14.21 and hence cannot 
be optimal. 

(iii) By part (i) the optimal policy has a threshold type on 
the interval [fjf, tjf+i]. Assume that the threshold s satisfies 
s < tK+\- It is direct to show that by following u till time s 
and then switching to any policy that satisfies Ui(t) > for 
all i, P s (t) strictly increases. o 

V. THE CONSTRAINED PROBLEM 

Let u be any policy that achieves the constraint £(t) = ex 
as defined in Section lTl-BI We make the following observation. 
The constraint involves only X(t). It thus depends on the 
individual Xi(t)'s only through their sum; the sum X(t), in 
turn, depends on the policies Ui's only through their sum 

u = Yld=i Ui - Work conserving policies. Any policy which 
is not a threshold one can be strictly improved as described in 
Lemma 14.21 Consider the case of work conserving policies. 
Then the optimal policy is of a threshold type [1]: u = 1 
till some time s and is then zero, s is the solution of 
X(s) = z + x, i.e. 



-log 



1 — x — z 
1 - z 
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Algorithm A can be used to generate the optimal policy 
components Ui(t), i — 1, . . . , K. General policies Any policy 
u that is not of the form as described by (i)-(ii) in Theorem 
14.21 can be strictly improved by using Lemma 14.21 Thus the 
structure of the optimal policies is the same, except that (iii) 
of Theorem 14.21 need not to hold. 

VI. Adding Fixed amount of Redundancy 

We now consider adding forward error correction: we add H 
redundant frames and consider the new file that now contains 
K + H frames. Under an erasure coding model, we assume 
that receiving K frames out of the K + H sent ones permits 
successful decoding of the entire file at the receiver. Let S n:P 
be a binomially distributed r.v. with parameters n and p, i.e., 
P(Sn, P = m) = B(p,n,m) := ("> m (l - p) n ~ m The 
probability of successful delivery of the file by time r is thus 

K+H 

P s (t,K,H)= J2 B{Di{T),K + H,j), 

where Di(r) = 1 — cxp(— A JJ" Xi(s)ds) is the probability 
that frame i is successfully received by the deadline. 
We assume below that the source has frame i available at time 
ti where i — 1, K + H, In particular, ti may correspond 
to the arrival time of the original frames i = 1 , . . . , K at the 
source. For the redundant frames, ti may correspond either 
to (i) the time at which the redundant frames are created by 
the source, or to (ii) the moments at which they arrive at the 
source in the case that the coding is done at a previous stage. 

A. Main Result 

Let Zi = J T Xi(v)dv, where i = 1, 2, K + H. 
Theorem 6.1: (i) Assume that there exists some policy u 

such that X)i=i u i(t) = 1 f° r a U t> ar, d such that Zi is the 
same for all i = 1, K + H under u. Then u is optimal for 
P2. 

(ii) Algorithm A, with K + H replacing K, produces a policy 
which is optimal for P2. 

Proof: (i) Let A(K, H) be the set of subsets h C {1, K + 
H} that contain at least K elements. E.g., {1,2,..., A'} 6 
A(K, H). Fix pi such that pj = u. Then the probabil- 

ity of successful delivery by time r is given by 

p s (t,k,h)= n^) 

h£A(H,K) i£h 

For any i and j in {1, K + H} we can write 

P s (t,K,H) = t(Zi)C(Zj)gi + (C(Zi) + ((Z 2 ))g 2 + g 3 

where gx, g 2 and g 3 are nonnegative functions of 

{Z(p m ),m ^ i, m ^ j}. E.g., 

where A V (K,H) is the set of subsets h C {1, K + H} 
that contain at least K elements and such that v C h. Now 



consider maximizing P s (r, K, H) over Zi and Zj Choose 
some arbitrary policies q and let Z'(q) = JJ" X(v)dv. Assume 
that Z[ 7^ Z'y Since £(•) is strictly concave, it follows by 
Jensen's inequality that + can be strictly improved 
by replacing Z[ and Zj) by Z t = Zj = {Z[ + Z' j )/2. This is 
also the unique maximum of the product C,{Zi)(^(Zj) (using 
the same argument as in eq. ©), and hence of P s (t, K, H). 
Since this holds for any i and j and for any p' < p, this 
implies the Theorem. 

(ii) Algorithm A maximizes the probability that K + H frames 
are received under a work conserving policy: the statement 
hence follows observing that P s (t, K, H) is monotonically 
not decreasing in H. o 
Remark 6.1: If the source is the one that creates the redun- 
dant frames, then we assume that it creates them after tx- 
However, it could use less than all the K original frames 
to create some of the redundant frames and in that case, 
redundant frames can be available earlier. E.g., shortly after t 2 
it could create the xor of frame 1 and 2. We did not consider 
this coding policy and such option will be explored in the 
following sections. 

In the same way, the other results that we had for the case 
of no redundancy can be obtained here as well (those for PI, 
CP1 and CP2). 

VII. Rateless codes 

In this section, we want to identify the possible rateless 
codes for the settings described in Section II, and quantify the 
gains brought by coding. In the reminder, information frames 
are the K frames received at the source at t\ < t% < ■ ■ ■ < tx- 
The encoding frames (also called coded frames) are linear 
combinations of some information frames, and will be created 
according to the chosen coding scheme. Rateless erasure codes 
are a class of erasure codes with the property that a potentially 
limitless sequence of encoding frames can be generated from 
a given set of information frames; information frames, in turn, 
can be recovered from any subset of the encoding frames of 
size equal to or only slightly larger than K (the exceeding 
needed frames for decoding are named "overhead"). As in 
the previous section, we assume that redundant frames are 
created only after tx, i.e., when all information frames are 
available. The case when coding is started before receiving all 
information frames is postponed to the next section. Since en- 
coding frames are generated after all information frames have 
been sent out, the code must be systematic because information 
frames are part of the encoding frames. A code is maximum- 
distance separable (MDS) if the K information frames can be 
recovered from any set of K encoding frames (zero overhead). 
Reed-Solomon codes are MDS and can be systematic. Notice 
that the analysis of such codes is encompassed in Section III, 
since they add a fixed amount of redundancy. Let us now 
analyze what are the rateless codes which can be used in this 
setting, i.e., which are systematic. LT codes [9], which are 
one of the efficient class of rateless codes, are non-systematic 
codes. In this context, "efficient" means that the overhead can 
be arbitrarily small with some parameters. Raptor codes [12] 
are another class of efficient rateless codes, and systematic 
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TABLE IV 
Algorithm C 



CI Use pt = e\ at time t £ [t\,t2). 

C2 Use pt = e2 from time t2 till s(l,2) = 
min(S(2, {1, 2}), t 3 ). If s(l,2) < t 3 then switch to 
Pt = \{ei + e2) till time t$. 

C3 Repeat the following for i = 3, K — 1: 
C3.1 Set j = i. Set s(i,j) = 



C3.2 Use pt 



efc from time s(i,j) till 
i}),t i+1 ). If j = 1 



i+l-j ^k=j 

s(i,j-l) :=min(S(i,{l,2 
then end. 

C3.3 If s(i,j — 1) < then take j = min(j : j S 

J(i, {1, «})) and go to step [C3.2]. 
C4 From t = tx to i = r, use all transmission opportunities 
to send a random linear combination of information frames, 
with coefficients picked uniformly at random in F 9 . 



Raptor codes have been devised. Network codes [10] are more 
general rateless codes as the generation of encoding frames 
relies on random linear combinations of information frames, 
without no sparsity constraint for the matrix of the code. These 
codes are MDS with high probability for large field size (and 
consequent complexity). LT or Raptor codes are only close to 
MDS, e.g., LT codes are MDS asymptotically in the number 
of information frames. In fact, they are aimed to reduce the 
encoding and decoding complexity. That is why in this section 
we provide the analysis of the optimal control for network 
codes. But, it is straightforward to extend these results to 
systematic Raptor codes. 

Let us determine what is the optimal policy u for sending the 
information frames, when network codes are used to generate 
redundant frames after tK- After t^, at each transmission 
opportunity, the source sends a redundant frame (a random 
linear combination of all information frames) with probability 
u. Indeed, from tx, any sent random linear combination 
carries the same amount of information of each information 
frame, and hence from that time, the policy is not function of a 
specific frame anymore, whereby u instead of u. In each sent 
frame, a header is added to describe what are the coefficients 
of each information frame in the linear combination the 
encoded frame results from. For each generated encoding 
frame, the coefficients are chosen uniformly at random for 
each information frame, in the finite field of order q, ¥ q . 
The decoding of the K information frames is possible at the 
destination if and only if the matrix made of the headers of 
received frames has rank K. In the following, we shorten this 
expression by saying that the received frames have rank K. 
Note that, in our case, the coding is performed only by the 
source since the relay nodes cannot store more than one frame. 
Recall the definition Zi = /J" Xi(v)dv, i = 1, . . . , K — 1. 

Theorem 7.1: Let us consider the above rateless coding 
scheme for coding after tj<- 

(i) Assume that there exists some policy u such that 

Y^i^i. 1 u i(t) = 1 f° r a U an d sucn tnat ^ is the same for 
all i = 1, . . . , K — 1 under u. Then u is optimal for P2. 

(ii) Algorithm C produces a policy which is optimal for P2. 



Proof: Let E be any set made of pairwise different elements 
from {1, . . . , K — 1}. We have 



Ps(r) 



BC{l,..,Jf-l} \i£E 



where Q(E) denotes the probability that the received coded 
frames, added to the received information frames, form a rank 
K matrix. Let e denote the number of elements in E, and P m 
be the probability that exactly m coded frames are received 
at the destination by time r. Let consider the probability that, 
given that m > K — e coded frames have been received, these 
frames form a rank K matrix with the received e information 
frames. We lower-bound this probability by the probability 
that only K — e coded frames form a rank K matrix with the 
e frames, and this probability corresponds to the product term 
in the following equation. Then we can lower-bound Q(E) by 



(K-e-l \ K-e-1 
l- £ Pm) n 
m=0 / r=0 



1 - 



1 



,K-(r+e) 



(9) 



Let us now express P m . Let Yff(i) denote the proportion 
corresponding to the number of coded frames released at time 
t and A be defined by A = A J Q T Y K (t) dt. We have P m = 
exp(— A)^y. Let Y(t) denote the proportion corresponding 
to the total number of frames in the network at time t: 

K 

Y(t) = X(t) + Y K {t) = £ X k {t) + Y K (t) . 
fe=i 

Since coded frames are released only after tx, Yx{t) = for 
t < tx- We can consider that Xxit) = for any t as coded 
frames are sent as soon as all the information frames have 
been received by the source. Thus, for k < K 



dX k (t) 
dt 



Xu k (t)(l-Y(t)) 



with Y(t) = X(t) for t < tx- Thus, for t < tx, equations 
dTJ to d4]> remain unchanged. For t > tx, X(t) — X(tx) and 
X k (t) = Xk(tx) for k < K. Hence, for t>tx, we have 



dY K {t) 
dt 



\u{\ - X{t K ) - Y K {t)) 



with Y K (t K ) = 0. Thus we get Y K (t) =0, Vt < t K , 
Y K {t) = (1 - X{t K )){\ - cxp(-Au(t - t K ))), Vt>t K - 
Finally 

A = A(l - X{t K )){ T - t K - -L + -L exp(-Au(r - t K )))) . 

AU AU 

Hence P s (r) > £ {( IJCC^O ) x 

EC{1,...,K-1} \ieE / 



\ K-e-1 



(10) 



g 



Thus, to maximize P s (t) for u = 1 in terms of the Zi, i = 
1, . . . , K — 1, it is sufficient to maximize its lower bound in 
terms of the Zi. From eq. (fTOl) . for maximizing the lower 
bound, we can see that the proof of Theorem |6. ll carries almost 
unchanged, as the second product term in eq. ( [Tol l results in 
weighting constants in front of each product in the summations 
of g\, (?2 and g^. Hence, the success probability is maximized 
when all the Z% are the same for all i = 1, . . . , K — 1. 

VIII. Rateless codes for CODING BEFORE tx 

We now consider the case where after receiving frame i 
and before receiving frame i + 1 at the source, we allow 
to code over the available information frames and to send 
resulting encoding frames between ti and tj+i. LT codes 
and Raptor codes require that all the information frames are 
available at the source before generating encoding frames. 
Due to their fully random structure, network codes do not 
have this constraint, and allow to generate encoding frames 
online, along the reception of frames at the source. We present 
how to use network codes in such a setting. The objective is 
the successful delivery of the entire file (the K information 
frames) by time 70. Information frames are not sent anymore, 
only encoding frames are sent instead. At each transmission 
opportunity, an encoding frame is generated and sent with 
probability u[t). Note that u is not relevant anymore because, 
at each transmission, network coding allows to propagate an 
equivalent amount of information of each of the frames in the 
source buffer, by sending a frame which is a random linear 
combination of all buffer frames. This is detailed later on. 

Theorem 8.1: (i) Given any forwarding policy u(t), it is 
optimal, for maximizing P s (t), to send coded frames resulting 
from random linear combinations of all the information frames 
available at the time of the transmission opportunity, 
(ii) For a constant policy u > 0, the probability of successful 
delivery of the entire file is lower-bounded by 



k 2 



K-l 



K 



£ Jf(k,h 



j=0 k^>->k } t =K-ki lj=K-Y. j i Zlk 



with f(l,k) 



Pi,k,iD k ,i(r), 



if I < k, 



^,fc,fcfl-E™io^, m (r)) , ifl = k 



and P w = nUo 1- 3>M. D kA?) = exp(-A fc )^, and 



A 



K 



X 



cxp( 



-Xut 



1 \ 1 / X > 

t K - — + — exp(-AMT ; 
Am / Am 



Proof: For all k = 1, . . . , K, let E(k) be E(k) = {1, . . . , k}. 
For sake of shorter notations, we say that a coded frame is "a 
frame over E(k)" if the coefficients of the first k information 
frames are chosen uniformly at random in F 5 , while the others 
are zero. We analyze first the probability of successful delivery 
of the file by time r, i.e., the probability of decoding of the K 
information frames. Let us first briefly discuss the general case, 
following the formalization in [10]. As previously mentioned, 

2 We do not have constraints on making available at the destination a part 
of the K frames in case the entire file cannot be delivered. 




K 




(a) 



(b) 



Fig. 3. Received encoding matrices, (a) The decoding fails because all the 
information frames cannot be recovered - matrix has not full rank, (b) The 
decoding is successful - matrix has full rank. 



the decoding is successful if the matrix of received coded 
frames has rank K. When no coding is used, the matrix 
of received uncoded frames can be only the identity for the 
decoding to be possible. Hence, if a frame is lost, only the 
same frame can recover the loss. However, when coding is 
used, we can send coded frames which are random linear 
combinations of all K information frames. Then, if any frame 
is lost, the rank of the received matrix results into K — 1: in 
order to get a mnk-K matrix it is sufficient to receive an extra 
coded frame which is independent of all previously received 
ones, i.e., dependent on the lost frame. This is known to 
happen with high probability as soon as q is large enough [10]. 
Let us now formalize the successful decoding conditions for 
our problem. As illustrated in Figure [3] we have the following 
definitions: 

• The received frames are over E(ki), with K = fco > 
fci > k 2 > ■ ■ ■ > kj > 1. 

• j is such that < j < K, and denotes the number of 
pairwise different ki ^ K, i = 0, . . . , j. We set kj+i = 0. 

• U, i = 0, . . . , j is the rank of received frames over E(ki). 
For the coding matrix to be rank K, i.e., for the decoding to 
be successful, it is necessary and sufficient to have (see Fig [3j: 

l > K - ki 

h>k 1 -k 2 -{lo-(K-k 1 )) 
: n-i 

l n > K — k n+ i - E h 

i=0 



J'-l 



For all i = 0, . . . ,j, i.e., for all states (number of available 
information frames) the source is in when transmitting, Zj is 
given by the number of transmission opportunities. Hence, 
to maximize the successful decoding probability, each ki, 
for all i = 0, has to be maximized. This means 

exactly making random linear combinations of all available 
information frames. Let us now express the probability of 
successful delivery of the file by time r, i.e., the probability 
of decoding of the K information frames. Let Yfc(i) be the 
fraction of nodes (excluding the destination) having a frame 
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over E{k) at time t. Let -Dfc,i(r) be the probability that 
exactly i frames over E(k) be received at time r: 



A 1 

DkA T ) = exp(-A fc )-f , 



A 



A 



exp(— Xutji) 



t K 



+ - — cxp(— Aur) 



where A& = A /J" Yk (t) dt. By unfolding calculations (see 
Appendix), we get 

—)+ — 

Xu J Am 

Then, by using some approximations (see Appendix) which 
are tight when q is large enough (e.g., when the frame size 
is a byte, i.e., q = 2 8 ), we obtain the lower-bound on P s (t). 
oLet us briefly compare the successful delivery probabilities 
for the different coding schemes: 

. No coding: P s (t) = C(^) 

• Adding fixed amount of redundancy: 

p s (t,k,h)= n^) 

heA(H,K) i£h 

• Coding after t^'- 

K K-e-1 K-e-1 -. 

^)>E{(V E p ™) n ^-^r?)) 

— n m=0 r=0 v y 

E (II era) } 



£C{l,.,Jf) i€E 

> Coding before tx '■ 

K-l K 

w>E E E ■■• 

j=0 fci>— >fcj Z = A"-fci 



E !/&>*.•) 



i=0 



Coding with rateless codes after tx allows to need an equal- 
ization of the Zj only for i = l,...,K — 1, i.e., for the 
information frames but not for the coded frames, unlike the 
scheme with fixed amount of redundancy. Coding before tx 
avoids the need for any policy u for each frame in order to 
equalize the Zi. This is due to the fact that, when transmitting 
a single coded frame, network coding allows to propagate 
an equivalent amount of information of each information 
frame, thereby circumventing the coupon collector problem 
that would emerge with single repetition of frames. Algorithm 
A addresses this problem by striving to equalize the Zi. 
Hence, even though all the frames over E(ki) do not reach 
the destination, it is sufficient to receive more frames over 
E(kj), j > i, to recover the file. We conjecture that such a 
network coding scheme may have a critical gain, compared to 
the uncoded strategy, especially when the mobility model is 
not random, as assumed in Sec. II. 

IX. Conclusions 

In this paper we addressed the problem of optimal trans- 
mission policies in two hops DTN networks under memory 
and energy constraints. We tackled the fundamental scheduling 
problem that arises when several frames that compose the same 
file are available at the source at different time instants. The 
problem is then how to optimally schedule and control the 
forwarding of such frames in order to maximize the delivery 
probability of the entire file to the destination. We solved this 
problem both for work conserving and non work conserving 
policies, deriving in particular the structure of the general 
optimal forwarding control that applies at the source node. 



Furthermore, we extended the theory to the case of fixed 
rate systematic erasure codes and network coding. Our model 
includes both the case when coding is performed after all the 
frames are available at the source, and also the important case 
of network coding, that allows for dynamic runtime coding of 
frames as soon as they become available at the source. 
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X. Appendix 
A. Proof of Theorem 031 

(i) The policy u* generated by Algorithm A is work 
conserving by construction. Let Z(r) and Z* (r) denote the K- 
dimensional CCI vectors corresponding to a work conserving 
policy u and to u*, respectively. We show in the following 
that it holds Z*(r) -< Z(r) for r > 0. Then Thm. [3721 implies 
that P*(t,u) > P s (r, u'), i.e., u* is uniformly optimal over 
work conserving policies. It is now immediate to observe 
that u* is optimal for P2 because it minimizes the expected 
delivery delay E[D] = / °°(1 - P s (t))dt. We now prove that 
Z*(t) -< Z(r) for Vr > 0. 
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Let Z(t) (resp. Z*(t)) be the CCI of u (resp. u*) at time t. 
It is sufficient to show that Z*(t) -< Z(t) for any t > 0. u* is 
generated by Algo A such that for all t: 
• u* maximizes the minimum of the CCIs: 



u* = are max min Z, (t) 

WC ui:ti<t 

u* minimizes the highest gap between two CCIs 



(ID 



u* = arg min max |Zj(t) — ZAi)\ (12) 

WC ui 7 j:ti,tj<t 

For lighter notations, we omit t as well as ti, tj < t when we 
refer to any i or j in the remainder of the proof. We have 



K 

E 



K 

Z li] = E H 
i=l 



We want to prove that 



k k 

z ® ^ E z k 

i=l i=l 



Vfc = 1, . . . , K — 1. Owing to property ( ITTb of u*, we have 
mini Zi < min, Z*, i.e., Z[x] < Z[k]- Thus, let si and s 2 be 
such that: 



si < i, 



z \i\ < z \i\ 



s 2 < i < si, Zj*j < Z[j] 

« < s 2 , Z [t] < Z[*j 

Let us prove by contradiction that s 2 does not exist. If s 2 
exists, then Z^j < Zr*,. Since 

max |Zj(i) - Z,-(i)| = Zm - Z [A1 
we would have then 

ry ry* ^ ry* ry* 

Z [l] _ Z [A] S ^[1] - *[K] 

which means that u* does not satisfies property (12\ anymore. 
Hence, we cannot have s 2 > 2. Thus we have: 

si < i, Z\n < zr. 



1 < i < si, Zj*] < Z M 



• k > si — 1: 



Owing to the definition of si, 53j=fe+i ^[«] — 



Y,*Lu,i Z? a , therefore 



fe 

E 



A' 



A" 



= E Z M ~ E Z M - 

1 i=l i=k+l 

K K k 

E z w ~ E z k = E Z H 

i— 1 z— fc+1 2—1 



fc > si - 2: 



For all i = l,...,k, Z w > Zfa, hence X)i=i z [i] > 
i=i z [i]- 



Thus, Z* -< Z and this ends the proof of (i). 

(ii) The proof is immediate from Thm. 13.11 o 

B. Alternative proof of Theorem 13.51 

In what follows we propose an alternate proof of the 
statement (i) of Theorem [331 which also is descriptive of the 
way Algorithm A works. We will need the following 

Lemma 10.1: Let x,y € R" and let j = arg min xj, n > 
j > 1, where xi > x 2 > . . . Xj-i > Xj. Assume S < — 
Xj) where J = n — j + 1 and let x' = (xi,x 2 , ...,Xj + 
6/],. . . ,Xj + 8 /f). Let also u 6 R" such that ^ u i = <5 an£ l 
y' = y + u. Then, 



x -< y =^ x' -< y' 



Proof. The key observation is that x' = (xyyuXn], 



+ 



8/1, 



■6/j) because J/j < (x^-i 



Let us assume 



by contradiction x 1 y', i.e., there exists 1 < w < K — 1 
such that 



w — l w—1 

^4^E 

/i=i 



a=i 



yffc] < E x {h] > E y w 

h=l 



h=l 



Weed, it must be w > j (if not, we would contradict x -< y). 
Now we use an argument involving piecewise linear functions 
defined on [0, n) built as described in the following. Let a 6 

R" and 4> a (t) = J2r=i a r^ f° r t = 0, . . . , n whereas <p a (t) = 
^o(^ — 1) + a m ■ (t — m) for m — 1 < t < m, m = 1, . . . , K . 
Notice that if a n is not increasing, then <j> a {-) is convex. Now, 
we observe that 

4>x'(t) = <t> x '(j-l) + (x j + 8/j)t 

for j - 1 < t < K. Furthermore, it holds 0^(0) < </» y '(0) 
and <f> x t (K) = <j) y i (K) from the assumptions. Thus, due to the 
continuity of <p y '(-), there exists interval I C [j — 1, K], such 
that W £ I, <px' > 4>y> in the interior of /, and <p x > = (\> y > at the 
end points. Since, <j> x '{t) is a straight line over /, we obtain 
that (f) y > is strictly concave in /, which is impossible because 
yL is decreasing by definition. Hence, our assumption is false 

and it must be x' <y' . o 
Proof. We follow the same conventions on the symbols used 
before; again, we prove that at any time r, Z*(r) -< Z(r) 
so that Thm. 13.21 let us state uniform optimality over work 
conserving policies, i.e., P s (t,u*) > P s (r,u) from which 
optimality for P2 is immediate. 

Here, we proceed by induction on the number of frames 
K. The induction basis is indeed verified for K = 2: 
Z(r) = Z*(r) = Z x (t) = Z*(t) for < r < t x , since 
both policies are work conserving and frame 2 arrives at time 
t 2 . Hence, it means that Z*(r) < Zi(r) for t 2 < r < t eq 
because u*(t) = over such interval, whereas Z*(r) = Z(r) 
since both policies are work conserving. For t > t eq , it is 
sufficient to recall the general relation [11, pp. 7] that holds 
for any n-ple of nonnegative real numbers (eti, . . . , a n ) such 
that a i = 1 ; 



(-,...,-) -< (ai, ...,a n ) 



II 



which implies, for any work conserving equalizing policy Z* 



1 



1 



■<Z-(a U ---,a„) = Z (13) 



where Z = %i = E£i 

Now assume that majorization holds for K — 1 and consider 
time tx when the K -th frame arrives: the inductive assumption 
let us conclude that Z*(t) -< Z(r) for all r < iR-. Also, 
by construction, there exists time t* > tx such that u* 
attains equalization: for t > t* the statement is indeed verified 
according to eq. (1131) . Now, we need to verify the statement 
for t £ [tic, £*]• During interval [i^-, s(iT, if — 1)), however, 
Z*(t) = Z*(t K ) and Zi(t K ) < Z t (r) for » = 1, . . . , K - 1. 
Hence, since Z*(£^) -< Z(tR-) 



if-i 

E 



if-1 



if-1 



Z* h (r) = J2 Z *h(tK) < E ^ E Z '^ T ) 



h=l 



h=l 



h=l 



so that Z*(r) -< Z(r) for r G [t Kl s(K,K - 1)). Let us 
now consider r G [s, s'), where s' = s(K,K — j — 1), and 
it holds j = min(j : j € J(s, {1, if })) according to the 
algorithm. Obviously, if j = 1, it holds t* = s and we are 
done. Otherwise, 1 < j < K . In this case, observe that for 
T G [s, s') 

Z*(t) = . . . , Z*(s)+6Z*, Z*(s)+5Z*) 

where JZ* = (Z(r) - Z(s))/(n-j + 1) and Z(r) - Z(s) is 
the increment in Z of any work conserving policy in (s,s']. 
Notice that, according to Algorithm A, 5Z* < (n — j + 
l)(Zj-i(s) — Zj(s)). Hence, it follows from Lemma W 0.1 1 that 
Z*(t) -< Z(r) for r G [s, s'). Over subsequent intervals, the 
same reasoning done for r G [s, s') holds unchanged, which 
concludes the proof because we showed that Z*(t) -< Z(t) 
for all t > 0. 

C. Proof of Theorem |4jJ 

Proof. Let u be an arbitrary policy and Xj, i = 1,2 the 
corresponding dynamics. During time [£1,^2) only frame 1 
is available, so clearly U2 = until time ti\ also, denote 
£i = Xi(r). Consider one-dimensional cases: it is known 
from [1] that the policy that maximizes J„ Xi(t)dt among 
those that achieve the same constraint Xi(r) = fj, i = 1,2 
is necessarily a threshold one [1]. Denote Sj the thresholds 
that achieve the same constraint £j, i = 1,2 in the one- 
dimensional case: optimal threshold policies have minimum 
support, so that < Si + S2 < r. Hence, we can construct the 
following policy u' : u' = (1,0) for < t < Si, u' = (0, 1) 
for min(t2,si) < t < S2, and u' = (0,0) otherwise. It 
must be P s (r, u) < P s (r, u') (otherwise we would incur into 
contradiction with the result [1] in the 1-dimensional case). We 
observe that in case si + S2 < t, u' P s (r, u') can be further 
increased by simply letting S2 = t—si, so that u' = u'(si): in 
order to complete the proof we only need to prove that si < 
and that such a policy is then unique. By contradiction: let us 
assume s\ > £2 and observe that u' is work conserving, but 



this contradicts Thm 13.41 so that si < t%. 
We conclude that u'(sx) has the structure claimed in the 
statement. Finally, we observe that s\ is indeed unique, 
since Xi{r) is determined by the difference r — ti, which 
corresponds to a unique value Xi(j) (note that it must be 
t\ > t — t2 otherwise equalization would be possible via 
a work conserving policy), the explicit expression for the 
threshold is obtained imposing X(s) = 1 — X(t — ty). 



D. End of proof of Theorem |8J 

Proof. Let us express A k . Let Y(t) be Y(t) = J2k=i Y k{t)- 
A fluid approximation can be applied to Y(t) and Yfe(<), k 
1,...,K: 

dY(t) 



dt 



= u\(l - Y(t)) 



If we consider Y(0) = 0, we get Y(t) = 1 — cxp(— Xut). 
Then we have, for k < K , Y k (t) = 0, Vi < t k , 



dY k (t) 
dt 



= uA(l - Y(t)), Vi fc < t < t k+1 , 



Y(t) = Y(t k+1 ), > tfc+j , 
Thus for k < K we get 

Yfc(t) = cxp(-Auifc) - exp(—\ut), Vt fc < t < t k+1 , 
Y k (t) = cxp(-Auifc) - cxp(-Aut fe+ i), Vi > t k+1 , 
and Yft-(i) = exp(— \utic) — exp(-Xut), Vt k < t . 



Thus for k < K A k = \[cxp(—\ut k ) [ 
1 



1 

Xu 



A A ' = A 
Then 



exp(-Aui fe+ i) 
exp(— Xutfc) I 



x-i 



Aw 
-t K 

K 



tk+i - r ] , and 



1 



+ - — cxp(— Xut) 
Xu 



^) = E E E 

3=0 fci>— >fcj l =K-ki 



e n««. 



where is the probability that the received frames over E(ki) 
have rank ij. Let P m ,k,l be the probability that to rows (headers 
of frames) of size k have rank at least I < k, when the elements 
are chosen uniformly at random in F„. Thus 

K-l K 



p s(r)-J2 E E 

j—0 ki>--->kj lo—K—ki 



n e^ 



1 



■ E 

D ki , m (T)) ■ 



i—0 \m—Li 

When q is large enough (e.g. q = 2 8 ): 



12 



• If k = h, then J2m=l, P m,kiM D kum > 
PkiMM U - Em=o -Cfei.mCr)) and the higher 
the tighter this lower-bound, since the probability that 
random vectors on F^" be linearly independent tends to 
1 as q tends to infinity. 

. If k < ki, P m ,ki,k and Pm-l,ki,h on one hand ' and 

-Dfei,m( T ) and £)fc i>m _i(T) on the other hand, are of 
the same order for high q, while (l/q ki ~ li ) m ' ! << 
(l/q ki ~ l *) m 1 *. That is why we use the lower-bound: 

oo j ^ \ m— U 



m—li 



Also, note that the higher q, the tighter this lower-bound. 
Hence we have the following lower-bound: 

Ps(t)>J2 ££•••£ n/ft,*.), 

f Pi,k,iD k ,i(r), if J < ft, 

Wlth/(Z ' fc)= ^, fe fl-Etrio^(r)), ifl = * 



where P;,fc,; = n!-=o — ' ^ ue to tne a ^ ove n °tes, the 

higher q, the tighter this lower-bound on P s (t). o 



